Prediction of Intrinsically Disordered Proteins Using Machine Learning Based on Low Complexity Methods

نویسندگان

چکیده

Prediction of intrinsic disordered proteins is a hot area in the field bio-information. Due to high cost evaluating regions protein sequences using experimental methods, we used low-complexity prediction scheme. Sequence complexity this scheme calculate five features for each residue sequence, including Shannon entropy, Topo-logical Permutation entropy and weighted average values two propensities. Particularly, first time that permutation has been applied sequencing. In addition, data preprocessing stage, an appropriately sized sliding window comprehensive oversampling can be improve performance our scheme, ensemble learning algorithms are also verify results before after. The show adding improves algorithm, which MCC value improved from original 0.465 0.526 proving its universality. Finally, compare simulation with those some existing schemes demonstrate effectiveness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequence Complexity of Amyloidogenic Regions in Intrinsically Disordered Human Proteins

An amyloidogenic region (AR) in a protein sequence plays a significant role in protein aggregation and amyloid formation. We have investigated the sequence complexity of AR that is present in intrinsically disordered human proteins. More than 80% human proteins in the disordered protein databases (DisProt+IDEAL) contained one or more ARs. With decrease of protein disorder, AR content in the pro...

متن کامل

Intrinsically disordered proteins.

Our understanding of protein function has been predominated by the view that proteins need to adopt a defined three dimensional structure to be able to carry out their function. Indeed, crystal structures of numerous proteins have been instrumental in establishing the structure– function paradigm. For example, the structures of numerous enzymes have highlighted the need for particular chemical ...

متن کامل

Intrinsically Disordered Proteins

In the same way that neither a messy lab bench nor a clean one is a reliable indicator of a researcher’s productivity, a protein’s function cannot be judged solely on the basis of its neatly folded and stable domains. As evidenced by recent work discussed in this Select, we are learning that intrinsically disordered regions feature in many of the cell’s most productive multitaskers, proteins wh...

متن کامل

CECAM workshop on Intrinsically Disordered Proteins

With the increasing need to integrate different areas of science in the study of intrinsically disordered proteins we arranged a meeting entitled “Intrinsically Disordered Proteins: Connecting Computation, Physics and Biology” in Zürich in September 2013. The aim of the meeting was to bring together scientists from a range of disciplines to provide a snapshot of the field, as well as to promote...

متن کامل

Intrinsically Disordered Proteins: Methods for Structure and Dynamics Studies

Intrinsically disordered proteins (IDPs) partially or completely lack a cooperatively folded structure under native conditions, preventing their equilibrium state from being adequately described by a single structural model. Reaching the goal of quantitatively describing IDP structure–function relationships has required the development of novel experiments aiming to characterize their structure...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Algorithms

سال: 2022

ISSN: ['1999-4893']

DOI: https://doi.org/10.3390/a15030086